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Abstract 

In this thesis, parallel algorithms have been obtained for some problems on graphs. 

1. It is shown that the problems of computing Maximum Independent Set, Min- 
imum Dominating Set and Minimum Clique Cover on Interval Graphs can be 
solved in 0(1) time on a 7i x n reconfigurable mesh. 

2. A parallel algorithm requiring O(logjJi) time using m processors on the CREW 
PRAM is obtained for the most vital edge (MVE) problem. G — {V,E) is a 
weighted undirected graph with n vertices and m edges, each edge c having a 
weight w{e) assigned to it. If f{G) is the weight of a minimum spanning tree of G 
the MVE of G is an edge e such that /(G— e) > /(G — e')for every other edge e' 
of G. Sequential algorithm for the problem takes 0(min(m -b n log n, ma{m, n)) 
time[18]. 

3. A parallel solution is obtained for computing a DFS tree in a permutation graph. 
The algorithm takes 0{\og^n) time using n processors on a CREW PRAM or 

processors on an ARBITRARY- WRITE PRAM. 

4. A parallel solution is obtained for computing a BPS tree in a permutation graph. 
The algorithm takes O(logn) optimal time using processors on an EREW 
PRAM. 

5. Optimal parallel algorithms are described for computing the connected and 
biconnected components of a permutation graph. The algorithms can be imple- 
mented in G(logloglogn) time with iogiogiog ' n Processors on a COMMON (or 
TOLERANT) CRCW PRAM, or alternatively in G(logn) time with pro- 
cessors on a CREW PRAM. Algorithm for connected components can also be 
implemented in G(Iog* n) time with processors on a Priority- Write PRAM. 
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Chapter 1 


Introduction 


A reference to an algorithm normally implies a sequential algorithm, i.e a single 
instruction can be executed in an instant of time. In recent years, there has been 
an increasing awareness of the bottleneck in sequential computing, namely physical 
laws (minimum size, minimum distance between transistors) . which determine the 
maximum speed a sequential computer (based on transistors as the basic computing 
element) can achieve. 

To overcome this bottleneck, i.e obtain faster running times, the concept of parallel 
computing has been introduced as a viable alternative. The central idea is to execute 
multiple steps of the given algorithm simultaneously; this expectedly reduces the total 
computation time. A parallel computer does this by using multiple processing units 
(processors). A problem to be solved is partitioned into subproblems which are then 
assigned to different processors so that they can be executed simultaneously; once 
these computations are over, the solutions are combined to obtain the solution to the 
original problem. A parallel algorithm is a solution method for solving problems on a 
parallel computer [3]. 

In this thesis, parallel algorithms are presented to solve some problems belonging 
to an important class, namely graphs. Graphs were first introduced by Euler in 
1736 and, since then have found innumerable applications in almost all domains of 
Engineering and Science [12]. In the next setion, we take a look at some graph- 
theoretic preliminaries where we describe some common terms with which the reader 
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will be assumed to be familiar throughout the rest of this thesis. 


1.1 Graph-theoretic preliminaries 

The definitions here are mostly consistent with [14]. 

• Graph A graph G =■ {V,E) consists of a finite set of vertices V and a finite 
set of edges E where each edge is an ordered pair, (x,y), x,y E V. The graph 
G is said to be undirected if the edge relationship is symmetric, i.e {x,y) G 
E 4^ (y,®) € E. Subsequently, the term graph refers to undirected graphs 
only. A subgraph of G = {V, E) is any graph G' = {V',E') where V' C 1/ and 
E' C Ef]{V' xV). 

• Isomorphism Two graphs G = {V,E) and G' = {V%E') are called isomor- 
phic, if there is a bijcction f : V V such that Va:,y G V, (a:,y) e E <=¥ 

im.m) e E-. 

• Complement The complement of the graph G = {V,E) is the graph (7' = 
{V,E') where E' = {(x,y) G V x K and (x,y) ^ 

• Path A path between two vertices vj and u/ in G is a sequence of vertices 

[vi,U 2 , such that £ E Vi G [!..(/ - 1)]. 

• Cycle A cycle is a sequence of vertices [ui, such that (u,-, v,+i) G 

E Vi €[!..(/ -1)1 and (i;,,t;i)G£?. 

• Connected graph A graph is connected if there is a path between any two 
vertices in the graph. 

• Tree A tree T = (V, J?) is a connected acyclic graph. 

• Minimum spanning tree Let G = (V, E) be an undirected connected graph 
where each edge has a weight assigned to it. A subgraph G' = (V,E') is a 
minimum spanning tree iff G' is a tree and the sum of the weights of edges 
belonging to G' is minimum among all such trees. 

• Clique A clique in G is a subset V' of the vertex set V of G such that there 
is an edge between every pair of vertices in V, i.e x,y £ V' ^ (2^,y) € E. 
A clique is maximal if no other clique of G properly contains it. A clique is 



maximum if the cardinality (number of vertices) of the clique is not less than 
the cardinality of any other clique of G; this cardinality is denoted as A 

minimum clique cover is a partition of the vertices in V such that each subset 
is a clique; additionally, the number of subsets is minimum- this number is 
denoted as A:(G). 

• Independent set An independent set in G is a subset V' of the vertex set V 
of G such that there is no edge between any pair of vertices in i.e x,y €. 
V => (ic, y) ^ E. The number of vertices in an independent set of maximum 
cardinality is denoted as q;(G). A colouring is a partition of the vertices in V 
such that each subset is an independent set. The chromatic number of G is 
a colouring in which the number of such subsets is minimum- this number is 
denoted as x(G). 

• Perfect graph In any graph, a)(G) < x(<^)- class of undirected graphs 
for which u}{G) = x(^) is termed as the class of perfect graphs. This is a well- 
studied class of graphs [6] and important from a practical view-point. Note that 
many graph problems which are NP-complete for general graphs, e.g, finding 
lb(G), are solvable in polynomial time for graphs belonging to subclasses of this 
class; even linear time algorithms are possible, e.g for interval graphs. 

In the next two subsections, we consider two important subclasses of perfect graphs 
for which parallel algorithms are dc.signcd in this thesis. 

1.1.1 Permutation graphs 

We consider a permutation tt of the n integers in the range [l..n]. For an integer i, 
7r“*(i) denotes the po.sition in tt where i occurs. The graph G{'k) = {V,E) is defined 
in the following manner: for each integer i G [l..n] there is a corresponding vertex in 
V and an edge (z, j) € £■ iff (z - j)(7r"^(z) - 7r“’(j)) < 0. An undirected graph G is 
called a permutation graph iff there exists G{Tr) for some r, such that G is isomorphic 
to G(x). 

A common representation of permutation graphs is the matching diagram [14]. 
This consists of two rows of integers; the upper row, U is the sequence of all integers 
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Fig la. Permutation graph Fig lb: Matching diagram 

in the range [L.n] while the lower row D is the given permutation, tt. Each integer 
i in t/ is connected by a straight line to that position j in D such that 7r{j) = z; 
thus we may view the graph as a bijective mapping of integers from U to D] an edge 
(i, y) is in the graph if and only if the lines i and j intersect in the matching diagram. 
In Fig 1 we have an example showing the permutation graph and matching diagram 
corresponding to x = (3,1,2,4,8,7, 5, 6). 

This class of graphs has been well-studied [13] and is an important subclass of 
perfect graphs with a large number of practical applications. Before proceeding to 
give an example, we recall some of the important properties of graphs belonging 
to this class: (a)they are transitively orientable, i.e each edge can be assigned a 
direction such that the resulting oriented graph (V, F) satisfies the following property: 
{x,y),{y,z) € F (a:,2) € F, where x,y,z € V (b) their complement also belongs 
to this class. 

Application Consider a set of cities which can be partitioned into two subsets Si 
and $2 such that all cities in Si (also 52 ) He along a straight line. There are flights 
between cities in Si and S 2 scheduled at the same time. The problem is to assign 
altitudes to each flight path so that intersecting paths are at different altitudes. 

To solve this problem, a bipartite graph is constructed with Si and S 2 as the two 
parts; corresponding to each city there is a vertex, and there is an edge between a 
vertex in Si and a vertex in S 2 if there is a scheduled flight between the two cities 



corresponding to the vertices. From this bipartite gra[)li, a matching diagram [14] can 
be easily extracted (by splitting each vertex into multiple vertices, one for each edge 
incident on it), i.e the given problem can be framed as a permutation graph problem. 
The modified problem is to compute a colouring of the given graph, so that vertices 
which have an edge connecting them (or flight paths intersect) are assigned different 
colours (altitudes). 

1.1.2 Interval graphs 

Consider a set of intervals on the real line (or in general, any linearly ordered set). 
Corresponding to each interval, there is a vertex and two such vertices have an edge 
between them iff their corresponding intervals intersect. Such a graph is called an 
interval graph. To put it slightly more formally, if the interval is represented as 
h = (o. < 6,), there is an edge in the interval graph between vertices /,• and 

Ij iff either Cj < a,- < bj or aj < bi < bj^ where we make the simplifying assumption 
(without any loss of generality) that each ai,bi,aj,bj is distinct. 

This is an important subclass of perfect graphs with many applications in practical 
domains. As an example we consider the following application [14]. 

Application There are n medicines, Mi,M 2 ,..Mn which need to be stored at suitable 
temperature levels; to be more exact, the temperature for storing medicine M,- needs 
to be kept constant and must lie between 2) and 2^. The problem is to find out the 
minimum number of refrigerators needed to store all medicines. 

An interval graph G is constructed in the following manner: for each vertex Mi, 
i G [l..n] there is a corresponding interval /,■ = [2), 2y]. Now, if we consider some 
temperature 7’, all intervals which pass through T have edges with each other(i.e, they 
form a clique) and the corresponding medicines can be kept in the same refrigerator. 
Thus, to solve the given problem, we need to find out the minimum number of cliques 
partitioning the vertex set or, the minimum clique cover has to be computed in G. 



1.2 Computation models 


The models of parallel computation on which the algorithms in this thesis have been 
designed are the Parallel Random Access Machine [PRAM) model and the reconfig- 
urable mesh model. 

A reconfigurable mesh of size rn x n consists of an m x n array of processors 
connected by a reconfigurable bus system. Each processor has four ports besides its 
arithmetic and logic unit. By adjusting the local connections between the ports of a 
processor, the bus system can be dynamically configured. Communication is carried 
out between processors by writing or broadcasting values on the bus. This model is 
more thoroughly defined in Chapter 2. 

In the PRAM model, there are a finite number of processors (RAM’s) operating 
synchronously on an infinite global(shared) memory. It is assumed that each processor 
has an unique index which it knows. All processors execute the same program; in 
each step, a processor may carry out local computation, or it may access the global 
memory. 

Depending on the nature of access to the memory, this model can be further 
divided into subclasses. The specific subclasses referred to in this thesis are: 

Exclusive Read, Exclusive Write (EREIV): No two processors can access the same 
memory location in a single unit of time. 

Concurrent Read, Exclusive Write {CREW): Multiple processors can access the 
same memory location simultaneously for reading, but exclusivity is maintained while 
writing. 

Concurrent Read, Concurrent Write (CRCW): Multiple processors can read from 
as well as write into the same memory location simultaneously. If concurrent write 
is allowed, there needs to be some conflict-resolution rules to determine the unique 
value contained in a memory location where multiple processors have tried to write 
simultaneously. The further subclasses of the CRCW model referred to in this thesis 


are: 



ARBITRARY: As the name denotes, some arbitrary processor succeeds when 
multiple processors are trying to simultaneously write different values to the same 
location. 

TOLERANT: If multiple processors try to simultaneously write to the same loca- 
tion, the contents of the location do not change. 

COMMON: All processors trying to simultaneously write to the same location 
must attempt to write the same value which then gets stored. 

PRIORITY: Among the processors trying to simultaneously write to the same 
location, the least numbered processor succeeds. 

1.3 Overview of thesis 

In Chapter 2, constant time solutions are obtained for some problems in interval 
graphs on the reconfigurable mesh model of computation. In Chapter 3, the problem 
of computing the most vital edge of a graph is considered and an 0(log m) time, m 
processor solution is described on the CREW PRAM. In Chapter 4, the problems 
of parallel construction of a DFS tree and a BFS tree for a permutation graph are 
considered. The construction of a BFS tree takes O(logn) optimal time using 
processors on an EREW PRAM while the construction of a UFS tree takes (9(log® n) 
time with n processors on a CREW PRAM. In Chapter 5, fast optimal solutions are 
described for computing the connected and biconnected components of a permuta- 
tion graph. These solutions take O(logloglogn) time with ipgiogiog" processors on a 
COMMON (or TOLERANT) CRCW PRAM. Some conclusions are finally offered in 
Chapter 6. 



Chapter 2 


Constant Time Algorithms for 
Interval graphs on Reconfigurable 
mesh 


2.1 Introduction 

Interval graphs model a large number of real-life problems in scheduling and VLSI; for 
e.g, the intervals can be job-starting times in a job-schcduling problem or positions of 
components in a VLSI routing problem. It is therefore important to design parallel 
algorithms to obtain fast solutions for problems on these graphs. In this chapter, 
we look at the techniques needed to solve these problems on processor arrays with 
reconfigurable bus systems or reconfigurable mesh. It is possible to develop constant 
time, i.e 0(1) algorithms, on this model while there are very few non-trivial problems 
which have a constant time solution on the CRCW, the most powerful PRAM model. 

The reconfigurable mesh is a powerful model of computation- the 2-d mesh has 
been shown to be at least as powerful as the CRCW model of parallel computation 
[33]. In fact, 0(1) time algorithms are known for some problems (on reconfigurable 
mesh) for which a lower bound of on time is known on CRCW model 

(with polynomial processors); these include the problems of computing transitive 
closure of an undirected graph which can be done in 0(1) time on a 3-d mesh with 
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n X n X n processors, or on a 2-d incsh with x processors [32]; n numbers can 
be sorted in 0(1) time on a 2-d mesh using n x 7i processors [19]; these results imply 
0(1) time algorithms for various graph problems like bipartite graph recognition and 
computing biconnected components within the same resource bounds. An important 
aspect of this model is the fact that a VLSI chip, YUPPIE [21] has been developed to 
demonstrate the concepts of the polymorphic-torus network which is in functionally 
the same class as the model we use. The reconfiguration is achieved at a fine-grained 
level by circuit-switching- ports connected together are literally ‘shorted’. 

In this chapter we consider the problem of obtaining 0(1) time implementations 
of some interval graph algorithms on the 2-d mesh. Specifically, we consider the maxi- 
mum independent set, minimum clique cover and minimum dominating set problems; 
the algorithms for each of these problems take O(logn) time with n processors on 
an EREW PRAM. As in general, intervals need not be sorted, the set of intervals 
provided need to be sorted, which leads to these resource bounds. We obtain imple- 
mentations using 0{n^) processors and constant time, which matches the resource 
bounds for sorting. 

This chapter is organized as follows: in Section 2.2, the reconfigurable mesh model 
is described. In Section 2.3 we present solutions to some problems on the reconfig- 
urable mesh. In Section 2.4 we show how these Solutions can be applied to solve the 
interval graph problems. Finally, in Section 2.5 we conclude with some final remarks. 


2.2 Reconfigurable array in 2-d 

In this chapter, we use the “standard” reconfigurable mesh proposed by Miller et.al. 
[25]. A mesh of size m x n consists of m x n array of processors connected by a 
reconfigurable bus system. Each processor has the capability of performing basic 
arithmetic and logical operations and it is assumed that each such basic operation 
can be performed in one time unit, i.e., a comparison or an addition involving two 
numbers would take one unit of time. The processors are identified by their positions 
on the grid, i.e. PE{i,j), 1 < i < m, 1 < j < n, refers to the processor situated 



at the intersection of the ith row and the _/th column; each processor is aware of its 
identity. 

Each processor has four ports denoted as I/, D,L,7i (i-e., up, down, left, right 
respectively) besides its arithmetic and logic unit. These ports are the actual con- 
nection between the processor and the reconfigurable bus system; by adjusting the 
local connections between the ports of a processor, the bus system configuration can 
be changed dynamically (or reconfigured); for example, by connecting the U and D 
ports of each processor internally we obtain vertical buses; if we now disconnect some 
of the connections, we can split a vertical bus into subbuses. 

Communication is carried out between processors by writing or broadcasting val- 
ues on the bus (each bus is assumed to be capable of carrying O(logn) bits of data, 
where n refers to the size of the problem). Each broadcast is assumed to take one 
unit of time; further in a single unit of time, only one processor is allowed to write 
or broadcast on a subbus shared by multiple processors to avoid write conflicts. In 
case more than one processor attempts to broadcast, the final value on the bus is 
indeterminate. 


2.3 Basic Routines for Reconfigurable Mesh 

Some of the important routines we will be using in this chapter are 

• Minimum Minimum of n numbers can be computed in 0(1) time on a n x n 
mesh [24]. 

• Sorting On a n x n mesh, n elements can be sorted in 0(1) time [19]. 

• Nearest 1. Given a sequence of O’s and I’s, for each 0 it is required to find the 
nearest 1 to its left. The problem can be solved on a one-dimensional recon- 
figurable mesh (first row of 2-dimensional mesh) using bus-splitting technique 
[24]. We construct a horizontal bus and then split this bus to the left of each 
1. Now if each processor which contains a 1 broadcasts its identity on the bus 
to its right, the problem is solved. 



• Prerix-minima(maxinia). 'I'lic problem of computing Prefix-ininima( maxima) 
of n numbers, is considered in Subsection 2.3.2. 

• Mark all ancestors. Given a (rooted) forest, and a distinguished node u, the 
problem is to mark all ancestors of u, i.e., to mark all nodes from v to the root of 
the tree containing v. Solution to this problem is described in Subsection 2.3.1. 

2.3.1 Marking Ancestors 

We are given n nodes each of which contains a variable link referring to the identity 
of its parent in the forest (successor in linked list). We are also given a tlistinguishcd 
node V, and it is required to mark all ancestors of v in the forest, (its successors in 
linked list), i.e. we have to mark all elements lying between v, the chosen element 
and the root of the tree to which v belongs. This is a modification of the 0(1) time 
algorithm for connected component problem which requires a n x n x n or x 
mesh[32]; we will be using an n x n rnesh. We assume that input is in the topmost 
row, i.e. the processors PE{l,i) contain the input (the link variables). 

By connecting L and R ports of each processor, we will get a horizontal bus in 
each row; if j is an ancestor (successor) of element i, we connect rows i and j by 
means of a vertical bus in column j. Therefore, if link{i) = j and link{j) = k^ we 
see that a data path exists between elements i and k. Proceeding inductively, it can 
be easily shown that a data path exists between an element and all its ancestors in 
the tree; further the data buses of different trees arc disjoint. To ensure that proper 
descendants of v are disconnected, we disconnect all vertical U and D links of row 
containing v, except from v to its parent. 

Less informally the algorithm is; 

Step 1 Let us assume that the input is in the I’st row. Each processor PE{i,i), 

1 < i < n sets up the local connection {U,L}. In each row i, processors PE{i,j), 

1 j < i, set up the connection {L,R} and processors PE{i,j), i < j < n, set 
up the connection {U,D}. The net effect is to set up L-shaped buses. 

Step 2 Each processor PE{l,i) transmits its link value to processor PE{i,l) using 
the bus connection set up in Step 1. 



step 3 The buses are disconnected and straight horizontal buses are formed by con- 
necting the {L,R} ports of each processor. 

Step 4 Each processor PE{i,l) transmits the value it received in Step 1 to all the 
processors in its row. Thus, for an element initially present in the i’th proces- 
sor(of the first row) all n processors of the i’th row know its successor after 
execution of this step. 

Step 5 Each processor sets up the connection now there are n horizontal 

and n vertical buses. 

Step 6 Each processor PE{i,j) where j is the successor of i in the linked list sets up 
the local connection {U,L}, thus connecting the i’th horizontal bus to the j’th 
vertical bus. Also, each processor PE{i,i) sets up the local connection {U^L} 
to make a path between horizontal buses i and j . 

In this step, the elements in a linked list are connected in such a manner that 
if any one of them broadcasts some value, all elements in the list receive it. We 
are in effect computing transitive closure: if there is a data path i — » j and 
link(7) = k, then there is also data path i —*■ k. 

Step 7 Processor PE{k, h) where h is the start point in the linked list, disconnects 
its {U,L} connection. Then it transmits any value. All processors which receive 
this value are our desired successor elements. 

2.3.2 Prefix Minima 

We will use an n x n mesh to carry out the computation. Initially, n elements are in 
n processors of the topmost row. For each i, we will try to identify elements whose 
value is less than the value of element i and which also occur before i in the list; the 
minimum of these elements is the prefix-minima for element i. We can sort the items 
to get rank of all elements (rank of i is the number of elements less than i); we will 
assume that sorted items are again in topmost row. We next send the elements in 
topmost row down the column; as a result, contents of each row are identical to that 
of the topmost row. Items to the left of item i are less than item i. If we mark items, 
now left of i, which were originally left of z, and find the first (leftmost) marked item. 



Vi 


we are done. 

In more detail, 

Step 1 We sort given elements on key (val,pos) where val is the value of the element 
and pos is the initial position of the element in the unsorted list [19], such that 
processors in first row contain elements in sorted order. 

Step 2 Each processor PE{i,i), 1 < i < n sets up the local connection {f/,L}. In 
each row i, processors PE{i,j)^ 1 < j < set up the connection {L,R} and 
processors PE{i,j), i < j < n, set up the connection {U,D}. The net effect is 
to set up L-shaped buses. 

Step 3 Each processor PE{l,i) transmits pos to PE{i, 1) using the bus connection 
set up as above. 

Step 4 Each processor disconnects its previous connection and sets up the connection 
{L,R} forming horizontal buses. 

Step 5 Each processor PE{i,l) transmits pos to all the processors PE{i,j) in its 
row. At the end of this step, all the processors in the i’th row know the initial 
position of the element whose sorted rank is i. 

Step 6 Each processor disconnects its {L,R] connection and sets up its {17, D) con- 
nection forming vertical buses. 

Step 7 Each processor transmits pos to all the processors in its column; let 

this new value of pos received by the processors be denoted as pos'. 

Step 8 Each processor PE{i,j) compares pos with pos'. If pos < pos' and j < i, it 
sets a boolean flag to 1, else to 0. Rffcclivc'ly, in colunin i, llic oh'iiKnits which 
have a sorted rank less than i and their unsorted position is also less than i 
have th(!ir flags set to 1; the remaining elements have their flags set to 0. 

Step 9 In each column we find out the first occurrence of a 1. For this, we use 
bus-splitting in a similar fashion as in the algorithm of Nearest 1. The vertical 
buses are split at each 1 and the processors with I’s transmit their identity to 
the upper subbus through their U port. The value received in PE{l,i) is the 
prefix minima corresponding to element i. 
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2.4 Interval graphs 

2.4.1 Definitions 

Given a family J = {/.•,/ 2 , .../«} of intervals, after Olariu (27], we define first{X), 
right{Ii) and next{Ii). 

1. The parameter first{T) is the interval in family X which ends first : 
first{J) = Ij if b} = min{6,- | 1 < i < n} 

2. For (each) interval /<, we consider Jj, the set of intervals which overlap with 
/,•; right{Ii) is the interval of Ji which ends last. If there is no interval which 
overlaps with then the set is empty, and right{I{) is not defined. 

Thus if bj = max{6fc | ajt < 6,- < bje} is defined, right{Ii) = Ij, otherwise 
right{Ii) = nil. 

3. For (each) interval /,-, we consider X,-, the set of intervals which begin after the 
right end-point of /,•; next(Ii) is the interval of Zi which ends earliest. If there 
is no interval which starts after /,• ends, then the set Zi is empty, and next{Ii) 
is not defined. 

Thus, if bj — min{6jt | bi < a*} is defined, next{I{) = Ij, else next{Ii) = nil. 

It has been shown in Olariu [27] that computation of right{Ii) (for all /,) involves 
sorting and prefix-maximum; computation of next{li) (for all /{) involves sorting and 
suffix-minima. Thus, both the above functions can be computed on 2-d reconfigurable 
mesh in 0(1) time using only n x n processors. Also, computation of first{I) only 
involves computation of minimum of n numbers, and hence can be done within the 
same bounds 

2.4.2 Algorithms for interval graphs 
Maximum independent set 

A subset S of vertices in an interval graph is an independent set if the intervals they 
correspond to are mutually non-overlapping. The Maximum Independent 5e<(MlS) 
problem is to compute an independent set having maximum cardinality. Golumbic 
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[14] hais shown that there exists a MIS 5 in Z such that first{2) € S and for any 
It € 5, nexi{Ii) € S. 

Thus, if a forest is built such that each element /f points to next{Ii), the MIS 
consists of exactly those elements which are marked while traversing from first{T) 
to the root of the tree it belongs to. From the discussion in Section 3.1, the necessary 
computation can be carried out in 0(1) time on a n x n mesh. 

Minimum dominating set 

A subset S of vertices in a graph G = {V,E) is a dominating set if each vertex of 
V — S (outside subset S) is adjacent to some vertex of the subset S. The Minimum 
Dominating 5et(MDS) problem is to compute a dominating set having minimum 
cardinality. 

The algorithm as in [27] involves construction of a forest where each element 
points to right{next(Ii)){\I it exists) or to next{Ii) otherwise. The MDS consists of 
exactly those elements which are marked while traversing from right{first{X)) (if it 
exists) or fir$t{X) otherwise, to the root of the tree it belongs to. Again, as evident 
from previous discussions, the necessary computation can be carried out in 0(1) time 
on a n X n mesh. 


Minimum clique cover 

For a graph G = {V^E) clique cover' is a partition of vertices V of graph G into 
non-empty disjoint sets such that the vertices belonging to each set form a clique. 
Minimum clique cover (MCC) is a Clique Cover having smallest number of partitions. 

In an interval graph, the cardinality of the MCC is the same as the cardinality 
of the MIS and also the MCC can be easily obtained by a simple Nearest 1 compu- 
tation once the MIS is available, as shown in [27]. Thus, this algorithm can also be 
implemented in 0(1) time on a n x n reconfigurable mesh. 
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2.5 Conclusion 

In this chapter we considered fast implementations of interval graph algorithms on the 
2-d reconfigurable mesh. Since the recon figtirable mesh looks in some ways similar 
to a mesh of trees [26], it would be natural to ask about the time and processor 
complexities of these algorithms on this model. Let us consider the main blocks 
as laid out: prefix minima computation can be carried out in O(logn) time using 
n processors and sorting on a n x n mesh of trees takes O(logn) time. Here, the 
bottleneck operation is the marking of the elements in a linked list which can not 
be done in even o(log^ n) time on a n x n mesh of trees. However there is some 
evidence that it is extremely unlikely that there is an o(log’ n) time using polynomial 
processors [30]. Another open problem is to carry out list-ranking on a 2-d n x ri 
reconfigurable mesh. 



Chapter 3 


The most vital edge problem 


3.1 Introduction 

In many network applications, a problem of interest to the network designer is that 
of finding the edges of the network which are most important- if these edges are 
removed from the network, the performance of the network will be greatly affected. 
A most vital edge is an edge, which if removed, causes the maximum change(increase) 
in the cost of the minimum spanning tree {MST{G)) of the graph G. More formally 
it may be defined as follows: let G = {V,E) ho & weighted undirected graph with n 
vertices and m edges; each edge e has a weight w{e) assigned to it. Let f{G) be the 
weight of a minimum spanning tree of G if G is connected; otherwise f{G) = oo. The 
most vital edge is an edge e such that f{G — e)> f{G - e') for every other edge e' of 
G. The model of parallel computation used in this chapter is the CREW(Concurrent 
Read, Exclusive Write) PRAM. 

The organization of this chapter is as follows : in Section 3.2 some definitions are 
briefly reviewed. In Section 3.3 we describe an algorithm which runs on the CREW 
model in O(logn) time using processors; and in Section 3.4 an O(log7n) time, 
m processor algorithm is described. 
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3.2 Preliminaries 

We may assume that edge-conncctivity of G is more than or equal to two as otherwise 
a bridge can be easily located and reported as the most vital edge. We can further 
assume without loss of generality that the edge weights are distinct to ensure that G 
has an unique minimum spanning tree; otherwise we may order the edges in order of 
occurrence. 

If e is an edge of MST{G), and G — e is connected the i-eplacement edge r(c) [15] 
is defined to be the edge such that MST{G) — e + 7 '(e) is a minimum spanning tree 
of G — e. If G — e is not connected r(e) is undefined. As the most vital edge of G 
belongs to MST{G) [15], for a bridgeless graph G the most vital edge is that edge 
which maximizes u?{r(e)) — w{e) among e G MST{G). 

In this chapter, tree edges refer to edges belonging to MST{G), and non-tree the 
remaining edges in the graph. It will also be assumed that the minimum spanning tree 
MST{G) is available. Minimum spanning tree can be computed on the EREW(and 
hence CREW) models in O(lognloglogn) using n + m processors [ 20 ] and on the 
PRIORITY CRCW model in (9(logm) time using processors [11, 17]. 

Plane sweep tree[5, 2] 2’ can be defined as follows. Let S = {si,S 2 , ...,Sn} be a 
set of non-intersecting line segments in the plane and T be a complete binary tree 
with n -f- 1 leaves. The leaves of T, in left to right order correspond to intervals 
[(— 00 , xi ),(«!, 12 ) 5 •••(^^nj+cxD)]. Associatcd with each internal node u 6 T is an 
interval /„ which is the union of intervals associated with descendants of v. Let 7 r„ 
denote the vertical strip /„ x (— 00 , -(- 00 ). A segment s covers a node t; if s spans 
7 r„ but not TTparenttv)- For each node u G T let Cover{v) denote the set of segments 
that cover v. We will use the term Plane Sweep tree to refer to T together with the 
lists Cover{v) stored at each node. Atallah et. al ( Theorem 5.2 [5]) have shown that 
the plane sweep tree T can be constructed in 0(log n) time using n processors on the 
CREW model. At the end of the construction procedure, each node u in T has its 
list Cover{v) sorted by the “above” relation. 

Given a directed graph G = in which each node v contains a sorted list 
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C{v), the fractional cascading problem (10, 5] is to construct a data structure such 
that given a walk(a sequence of adjacent vertices) {vi,V 2 ,...,Vn) in G and an arbi- 
trary element a:, a single processor can quickly locate x in each C7(u,). The term 
multilocation refers to this procedure of locating x in multiple lists. 

If d(G) is the maximum degree(indegree/outdegree) of any vertex in G and Out{v) 
is the set of all nodes w eV such that {v,w) € E, Atallah et. al {Theorem 3.10 [5]) 
have shown that when d{G) is 0(1) or Out{v) is given in sorted order for each V, 
a fractional cascading data structure can be built for G in O(logn) time using 
processors on the CREW model; here n = |V| H- |iJ| + \C{v)\. 

3.3 Optimal algorithm for dense graphs 

In this section we describe a simple O(logn) time algorithm using processors. 
The time-processor product matches that of the sequential O(n^) algorithm of Hsu. 
et al [15]; these algorithms are optimal for dense graphs. 

The input is in an n x n matrix W; if there is a non-tree edge e G G of weight 
w{e) bctwccp vertices t, j then = H^[i, i] = u}{e), else W[i,j] = oo. 

Step 1 We build an auxiliary matrix C as follows: 

Step la Using processors initialize C[iJ] = oo. 

Step lb For each leaf vertex i pardo 
For all vertices j do 

C(i,j) = vni.j) 

Step 2 For each non-leaf vertex i pardo 
For each vertex j do 

Recursively define C{i,j) = min{l'V'(i,y), C(ii, j), •••} 

where * 1 ,^ 2 , ••• children of i. 

Parallel tree contraction technique [1] can be used to compute 
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Step 3 For each tree edge e from i to its parent p{i) in MST{G), let j* be such 
that = min{C(f, j); j is not in the subtree rooted at i }. Define r(e) = 

{i,n 

step 4 Find the edge c € MST{G) for which M;(r(c)) - w{e) is maximum. 

Lemma 3.1 The above algorithm correctly computes r(e) in O(logn) time with 
processors. 

Proof; The correctness of this algorithm is obvious from [15]. To obtain the time 
complexity, we note that Step 2 for a single (fixed) vertex j can be easily done in 
O(logn) time with processors [1]; thus step 2 takes O(logn) time with x 
n = processors. In Step 3, we assign n processors to each row of C7 - each 
processor checks in 0(1) time if it is in the subtree rooted at i . All processors which 
find that they do not match the required condition “remove” themselves and the 
remaining proces.sors proceed to coni|)utc the minimum. Clearly, this step can also 
be implemented in O(logn) time with processors. I 


3.4 An m-processor algorithm 

Though the algorithm in the previous section is optimal for dense graphs, for most 
applications it is preferable to have an algorithm whose cost is a function of number 
of edges in the graph, specially if the graph is sparse. In this section, we describe an 
O(logm) time algorithm using m processors. 

We first compute the pre-order number for all vertices with respect to MST{G); 
this can be done by Euler tour traversal technique [31] in O(logn) time with pro- 
cessors. Using the Euler tour, the number of descendants of each vertex in MST{G) 
can also be computed within the same resource bounds. We let pre(u) denote the 
pre-order number for vertex v and des{v) the number of descendants of vertex v, 
respectively . 

To compute the most vital edge, essentially, for each tree edge e, we need to 
compute its replacement edge v(e). Let us consider a tree edge e = (a,&) whore 
pre(a) < pre(b). As one end-point of r(e) is necessarily a descendant of b while the 
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other enci-point is a non-descendant, the rej)laccnieat edge r(e) is the minimum cost 
edge in tlio set of edges having one end-vertex in subtree rooted at b and the other 
end-vertex is not in the sub-tree rooted at b. 

We represent each non-tree edge {a,b), pre(a) < pre(b) as a horizontal segment 
in the plane with ^-coordinate ranging from a to 6, i.e segment {(pre(a),pre(b))}. 
The j/-caordinate of the segnunit is defined as follows: if there are k segments having 
first x-coordinate o, the f’th segtnent is assigned a y-coordinate = a -h The 
cost of each non- tree edge is assigned to its corresponding segment. Note that by 
definition, all the segments are strictly ordered on their y-coordinate. 

Now, any descendant n,- of a given vertex v, has a pre-order number between 
pre(v) and a*, = pre(v) -f des(v), i.e pre(v) < pre(vi) < exv- An edge starting from 
a descendant of v is represented by a segment whose first x-coordinate x' fulfills the 
condition pre(v) < x' < and an edge ending at a descendant of v is represented 
by a segment whose second x-coordinate x" lies between pre(v) and i.e pre{v) < 
x" < o:„. 

Thus, we can solve the problem if we can find for each vertex v the least cost 
segment among those which satisfy any one of the following conditions: 

1. The segment starts at ji, (pre(u) < ji < pre{v) + des{v)) and ends at j 2 , where 

> pre{v) -1- des{v) + 1. 

2. The segment starts at jf}, (1 < j[ < prc(u)) and terminates at {pre{v) < 
32 < pre(u) -f des(u)). 

The two cases are similar(they are in fact equivalent : case (2) viewed from the 
right is exactly same as case (1)). In subsequent discussions we will only consider 
case (1). 

As the set of segments is strictly ordered, a plane-sweep tree[5] T can be con- 
structed for this set. For each node u in T we consider Cover{v), the set of all 
segments that cover v. By construction of T the segments belonging to Cover{v) are 
sorted in increasing order of their t/-coordinates. This implies that segments starting 
from different vertices in G are arranged in increasing order of their x-coordi nates 
while segments starting from the same vertex appear consecutively in Cover{v). 

LIBRARY 
I !. T.. KANPUR 
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Now, for vcrtox v lot ho the leaf in T associated with the integer a„ + 1. By the 
property of plan<!-swo<‘p tree if wc consider the Icaf-to-root walk {E^, , uj—, root{T)) 

in T, the set C'ouer(£„){JCoyc7*(i;i)UC'ouer(u2)U — UC'ouer(roo<(T’)) contains ex- 
actly those segments that start before the vertical line /„ = a„ -h 1 and end after 
Thus our desired query is to locate the least-cost segment which belongs to this set 
and starts after(or at) pre(v). 

We can compute the least-cost segment starting after(or at) pre(y) for a node u,- 
belonging to 7' by a suffix rniniinafon key cost) on the list Cover(v{) followed by a 
binary search to find the first segment whose starting x-coordinate is greater than 
or equal to pre{v). Obviously, due to the suffix-minima cornputation, this segment 
can provide the answer to the desired query. To increase the efficiency (the number 
of elements in the sorted list) of the binary search, we remove from Cover{v) sets 
of segments which have the same starting x-coordinate and retain only the segment 
with minimum cost(after suffix computation) of each set. Naturally, the first segment 
of each set happens to satisfy this condition; so it is sufficient to retain only the first 
segment of each set. 

Thus, for. a single node Vi in the walk, the query can be answered by a single 
processor in O(logm) time, as |(7oucr(y,)| < m, assuming the suffix-computation has 
been performed. As the length of the walk is 0{logm), the overall query can be 
answered in 0(log^ m) time by a single processor. 

To answer the query faster, i.e in O(logm) time, we convert T into a fractional 
cascading data structure T' [5]. The query can be answered in the manner described 
below. In Cover(Ev) we find the first segment that has a x-coordinate greater than 
or equal to pre{v). Since suffix minima has been computed earlier, this gives us the 
minimum cost segment among those belonging to Cover(Ey) and starting after(or 
at) pre{v). We then perform multilocation along the leaf-to-root walk, i.e from Ey to 
root{V). From the property of T and T', only 0(1) time needs to be spent for each 
node along this walk. Since the length of the walk is 0(log m), answering each query 
needs O(logm) time. 

Theorem 3.1 The above algorithm correctly computes r(e) in O(logm) time with 
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m processors. 

Proof: 'Flic correct«cs.s of the algorithm follows from its construction. T can be 
constructed in O(logm) time with m processors [5]. For the suffix-minima step, since 
there are 0(m log rn) elements overall in T, this step again needs O(logm) time with 
m processors. T' is also constructible within the same bounds. As we have n queries 
each of which can be answered independently in 0(log7n) time, the query part uses 
n processors and takes O(logTn) time. Thus, the theorem follows. I 

3.5 Final remarks 

In this chapter, O^logrn) time algorithm using rn processors on CREW model has 
been described. However if an efficient algorithm for the maximum spanning tree 
can be designed, then, using an observation of Iwano and Katoh[18] the number of 
non-tree edges to be considered in the algorithm can be reduced to n only and hence 
the number of processors can also be reduced to 0(n). 


t 



Chapter 4 


DFS and BFS in permutation 
graphs 


4.1 Introduction 

Let JT be a permutation of n integers in range [l..n]. For this permutation, a graph 
G(7r) is defined in which there is a vertex for each integer i and an edge between 
integers i arid j if (* — j)(7r~*(*) — ir~^(j)) < 0 where denotes the positidn of 

integer i in ir. An undirected graph G = (V, £) is called a permutation graph if there 
exists a x such that G is isomorphic to G(7r) [14]. Thus, for a permutation graph we 
use the terms “vertex” t and “integer” t interchangeably in rest of the chapter. 

A DFS tree for a (general) graph G = (V,I!) is a spanning tree such that if any 
two vertices have an edge in the graph, one of them is an ancestor of the other (in 
this tree). A BFS tree is a spanning tree such that if any two vertices have an edge in 
the graph, their levels in the tree do not differ by more than 1. For (general) graphs 
G = (F, E), the problems of constructing BFS and DFS trees have simple 0(n + m) 
time sequential solutions, where n =| F | and m =| F |. The problem of efficient 
(optimal) parallel construction of a BFS or a DFS tree for a general graph is open. 

In this chapter, we consider the problems of finding a DFS tree and a BFS tree 
of a permutation graph provided the permutation tt is given. Under this eissumption, 
the best sequential algorithms for constructing a DFS takes O(min(nloglogn,n -\- 
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m)) and for BFS tree 0{n) time [29]. The proposed parallel solution for the DFS 
problem is based on finding longest increasing subsequence in parallel. This problem 
has a 0(n log log n) time sequential solution, but the best parallel solution requires 
O(log^n) time witii n processors on a Concurrent Head Exclusive Write (CREW) 
PRAM model of computation [23] and processors on an ARBITRARY- 

WRITE Concurrent Read Concurrent Write (CRCW) PRAM [28]. On a CREW 
PRAM, more than one processor can simultaneously read from the same memory 
location, but concurrent writes are not allowed. An ARBITRARY-WRITE CRCW 
PRAM model allows concurrent writes with the stipulation that in case of multiple 
writes in a single memory location, an arbitrary processor succeeds. The solution 
for the BFS problem takes O(logn) time using processors on an Exclusive Read 
Exclusive Write (EREW) PRAM. In this model, no two processors can simultaneously 
access the same memory location either for reading or for writing. 

The organisation of this chapter is as follows: in Section 4.2 we consider the 
problem of constructing a DFS tree and in Section 4.3 the problem of constructing a 
BFS tree. 

i 

\ 

4.2 DFS tree 

The proposed parallel algorithm is obtained by parallelizing a slightly modified version 
of sequential algorithm of Rhee et.al. [29]. The organisation of this section is as 
follows: first some preliminary definitions and results are discussed, the algorithm is 
then described and finally resource (processor and time) requirements of the algorithm 
are analysed. 

Preliminaries 

A minimum clique cover (MCC) of a graph is a partition of vertices of the graph 
into a set of cliques, such that the number of cliques is minimum. Since permutation 
graphs are a subclass of the family of perfect graphs, the cardinality of the MCC is 
the same as the cardinality of the maximum independent set (MIS) [14]. 
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it is well'known thut in a permutation graph, a clique is a decreasing sequence of 
integers and an independent set is an increasing sequence of integers (see e.g. [14]). 
A MCC, CC{v) = {Cl, C3, . . . ,C,) where q =j AfCC(5r) | of a permutation graph can 
be obtained in following inanner{29]: to obtain all elements Cj belonging to clique 
Ci, ail elements belonging to all Cj, 1 < j < (i — 1) axe removed from t; the first 
element left in jt is the first element (C/) of C< and subsequently each element C{ in 
Ci is the first element (going from left to right) less than C{~^ in reduced tt. As is 
obvious, each C,- is a maximal clique and the Ci’s constitute a clique cover. 

Thus, the vertices of each clique C,- are arranged in descending order; if l{i) denotes 
the leftmost (largest) element in Ci and r(t) denotes the rightmost (smallest) element 
in C,-, the C.’s are indexed such that the r(*)’s are arranged in ascending order. 

For clique C;, we consider all cliques Cj, {j < i) where Cj contains a vertex k such 
that k < l(i) and k has an edge with /(i) or, occurs after l{i) in x. The clique C/(i) 
is defined to be the clique with maximum index satisfying this condition, thus if 
f* = max[t I Ci contains k^k < l{i) and 7r“*(A:) > x“^(/(i))] then C/(z) = C«*. The 
element l-adj{i) is the largest value in Ci(i) which satisfies the given condition, i.e, if 
k* — max[A;,j k € C/(i) and k < l(i) and 7r"*(fc) > then l-adj{i) = k*. 

Next, for the clique C,- we consider all cliques Cj, {j < i) where Cj contains a 
vertex k such that k > r(i) and k has an edge with r(i) or, occurs before r(i) in 
X. The clique Cr(t) is defined to be the clique with maximum index satisfying this 
condition, i.e, if 

r = max[t I Ct contains k,k > r{i) and x"^(A:) < x“^(r(i))] then Cr(i) = C*. The 
element r-adj{i) is the smallest value in Cr(i) which satisfies the given condition, i.e 
if 

k* = min[fc | k € Cr(*) and k > r{i) and x“*(fc) < ^”^^(0)1 r-adj{i) = k*. 

Algorithm 

We next describe the algorithm for computing a DFS tree. All elements of a clique 
axe to be oriented in the same direction to avoid cross-edges. So, in some sense eadi 
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clique tries to choose the clique nearest to it and the elements set up their parent 
pointers accordingly. 

Step 1 Compute CC{v) = where the CiS are indexed as discussed 

earlier. 

REMARK: In this step, the minimum clique cover is computed and the elements 
in the cliques are arranged as defined earlier. 

Step 2 For each clique C,- compute Ci{i), l-adj{i), Cr{i), r-adj{i). Define Cg{i) to be 
the clique with larger index among Ct{i) and Crii). 

REMARK: In this step, each clique C,- tries to choose the clique Ct{i) nearest to 
it. Intuitively, consider cliques Cj, C*,, with j,k < i; further assume that k < j. 
Now, if the elements in C,- have edges with elements in both Cj and C*, while 
elements in Cj also have edges with elements in Cfc, then C,- chooses Cj and Cj 
chooses Ck as the nearest clique to it so that we can have the DFS traversal 
with Ck = parent(Cj) and Cj — parent(C,). 

Step 3 We define the parent p(C'i ) of each element Cf in the DFS tree as follows: 
p(C'j) = /* First vertex of first clique is the root */ 

For each element j in Ci do p(Cf) = Ci~^. 

j* In first clique, parent of each element is the previous element. This is 
subsequently referred to as the backward direction. The forward direction 
correspondingly implies that the parent of each element is the next element. */ 
For each element j in Ci do j* Find parent of vertices in i’th clique */ 
if l-ad3{t) € C,{i) then /* i.e, if C,{i) = ^(0 */ 

p{0/) = Ci~^. /* Ci is oriented in backward direction */ 
ifp(C'J(i)) = C]{i) then /*Nearest clique is oriented in forward direction */ 
p{l{i)) = l-adj{i). 

else /* Nearest clique is oriented in backward direction */ 
p{l{i)) = r(s). 

/* The parent of l(i) is the last element of the nearest clique */ 

else /* Cs(i) = Cr(i) */ 

p(Cf) = C'/'*’*. /* Ci is oriented in forward direction */ 



if p(Cg(i)) — C^(i) then /*Nearest clique is oriented in forward direction */ 
p(r(i)) = /(s). 

I* The parent of r(i) is the first element of the nearest clique Cs{i). */ 
else /* Nearest clirpie is oriented in backward direction */ 
p(r(i)) = r-odj(i). 

Initially, the elements of Ci are laid out in a straight chain with the maximum 
element (in value) as the head of the chain. Obviously, this is a DFS traversal for 
elements belonging to Cj. We refer to such a chain as a backward chain and a 
chain in the reverse direction as a forward chain. Subsequently, members of clique 
Ci check whether its nearest clique is oriented in the forward or backward direction. 
The condition p(/{C',(i))) € C,{i) is true implies that the nearest clique is oriented 
forward, otherwise, it is oriented in the backward direction. Once the orientation of 
the nearest clique is known, the exact relationship between Ci and C,(i) is checked. 

For example, let C,(t) be oriented forward and l-adj{i) € l{i) has an edge 

with all elements belonging to C,(i) and less than This l{i) cannot be hooked 

to any element k less than l-adj{i), as it would result in cross-edges (/(i) has an edge 
with i-adj(i) in the graph but is neither an ancestor nor a child of l-adj{i) in the DFS 
tree). Thus /(i) is hooked to l-adj{i). 

Lemma 4.1 The above algorithm correctly computes the DFS tree for the given 
graph. 

Proof: See (29]. ^ 

Lemma 4.2 Step 1 can be implemented in 0(log^ n) time with n processors on a 
CREW PRAM or in O(log^n) time with processors on an ARBITRARY 

write CRCW PRAM. 

Proof: Computing the maximum cardinality independent set (MIS) in a permutation 
graph is equivalent to computing the longest increasing subsequence (LIS) of x and 
can be done by finding the longest common subsequence (LCS) of two strings, one of 
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Fig 1 Fig 2 


them t and the other tlic sequence of the n integers 1 . . .n. This computation can 
be performed by Lu’s algorithm (23, 28] within the stated resource bounds. 

In the LIS computation of Lu’s algorithm, the doss of each point is also computed. 
A point (an integer) p belongs to a doss i if the longest increasing subsequence in 
ir[l,.T~‘{p)] is of length ». If elements belonging to all classes j, (1 < j < i) are 
removed, then the length of longest increasing subsequence (in reduced jt) for each 
element belonging to class t is 1, i.e the elements belonging to class i form a decreasing 
subsequence in k. Hence, the classes partition the integers in tt into decreasing 
subsequences; as eacli class is a decreasing subsequence in tt it is a clique in the 
graph. Also, the number of classes is equal to the cardinality of the MIS. Thus, the 
classes form a minimum digue cover. Observe that the value of r(i), the least integer 
belonging to class i, is more than the value of r(J) for class j, (j < i); otherwise, the 
element r(i) would occur in some class less than or equal to j. Thus, the i’th class 
is exactly O',- as defined. In Fig 1, corresponding to some permutation x the classes 
(cliques) computed by Lu’s algorithm are shown. 

From the above discussion, the resource bounds for obtaining the minimal clique 
cover C'C'(x) required in the algorithm are the same as those used in algorithm for 
finding an MIS. * 

Lemma 4.3 Step 2 can be implemented on an EREW PRAM in O(log n) time using 
j~ processors. 
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Proof: We discuss algoriihin for computation of l-adj{i)\ computation for r-adj{i) 
IS similar. An array iV is constructed in following manner: the positions of elements 
in Ci occur in successive locations of IV'’ in ascending order, (r(i) occurs first and l{i) 
last). The location just before the location for element r{i) in W contains another 
copy of the value of /(*). 'rhu.s. if | C, | = x, C,- occupies x + 1 locations in W. Also, 
the ordering between C,’s is maintained, i.e Cj, j < i occurs before C,- in W. We now 
claim that a nearest larger (8) computation in W solves the problem. 

To prove this, recall that the elements in each clique C{ are arranged in order both 
by value as well as po.sition in jt. Now, if C({i) = Cj, j < i, all elements in all cliques 
Ck ij < k < j) have positions less than l{i). So, none of the /(^:)’s of such cliques can 
be the nearest larger for element l{i). Now, searching from l{i) towards the left in 
W, once we reach C’i(t), we know that the elements are arranged in increasing order 
(as seen from Thus, the first clement that /(i) sees larger than itself is l~adj(i). 
The array W can be constructed optimally in 0(log n) time with linear work on an 
EREW PRAM and a nearest larger computation can also be carried out within the 
same bounds. Once l-adj(i) has been computed, the identity of Ci{i) is also known. 

I 

Lemma 4.4 Step 3 can be implemented on an EREW PRAM in O(logn) time with 
j— processors. 

Proof; It is obvious from tiie algorithm that necessary information for all elements 
cj of a clique Ci can be computed in 0(1) time from 0,(i). We therefore construct 
a tree in which there is a node for each clique O,- and the parent of each 0,- is C,(i). 
So, once the desired assignments are made to all elements of Oi, we just need a top- 
down algebraic tree contraction step (T-ATC) [1] to compute the information for all 
elements present in nodes of tree. This step can therefore be implemented in 0(log n) 
time using processors on an EREW PRAM. ^ 

Theorem 4.1 DFS tree of a connected permutation graph can be constructed in 
0(log®n) time with n processors on a CREW PRAM, or alternatively in 0(log n) 
time with processors on an ARBITRARY CRCW PRAM. 
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Proof: The proof of correctness follows from Lemma 4.1 and the resource bounds 
required for the computation follow from Lemmas 4.2, 4.3 and 4.4. I 


4.3 BFS tree 

Preliminaries 

The matching diagram (14) for a permutation graph consists of two rows of integers. 
The upper row, U is the sequence of all integers in the range [L.n] while the lower 
row D is the given permutation, v. Each integer * in U is connected by a straight line 
to that position j in i) such that Jr()) = i; thus we may view the graph as a bijective 
mapping of integers from U to D\ an edge {i,» is in the graph if and only if the lines 
t and j intersect in the matching diagram. 

In the matching diagram the elements which are tilted towards the right, i.e 
integers k such that jr"*(fc) > k are called jR-elements. Similarly, elements tilted 
towards the left, i.e, integers k, < k are labelled as i-elements. 

Observation 4.1 In a permutation graph, for each iJ-element r there exists at least 
one L-elernent I which intersects it such that I > 7r"‘’{r). 

Proof: Both U and D are permutations of n integers. If there does not exist a 
L-element I which intersects r, this implies that n — r integers in U are mapped to 
n - jr“*(r) < n -r integers in D, which is a contradiction. Again, if / < ir~^(r) for all 
i-elements I intersecting line r, let be maximum among all such /’s. But, n — jr“^(r ) 
integers in D would then map to n - /' > n - x~*(r) integers in U . Thus, there is a 
contradiction. ® 

Observation 4.2 In a connected permutation graph, for each i?-element r such that 
ir-^(r) < n, there exists at least one L-element I intersecting it such that / > tt ^(r). 

Proof: In observation 1 we have proved that there exists I where / > tt ^(r) for any 
i2-element. If f =s x"'*(r), then n — / integers in U map exactly to n — x ^(r) = n — l 
integers in i?, i.e the integers in U in the range f + 1 to n are in a separate connected 
component than the one in which integer I lies; thus there is a contradiction. I 
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b) if < jr“H^ 2 t+i) then p{j) = Pat+i; Uvel{j) = 

2i + 2. 

/* if the first odd-indexed element whose position is less than position 
of j in D is P^i-u tlie parent of j in the BFS tree is Pa+i*/ 

c) else /* I’he first odd-indexed element whose position is less than posi- 

tion of j in D is Ai+i- */ 

pU ) = ievel{j ) = 2t -f- 3. 

/• if this condition holds the parent of j in the BFS tree is Pst+s*/ 

REMARK: Effectively, in this step each element j has to locate its corresponding P^i 
i.e, the first even-indexed element in tlie marked path such that P^i < j- Once this 
element is located, the values of Pji+ij Pa-u etc, which are needed in (b) and (c) are 
also available. This is achieved by marking the values of the even elements (in the 
marked path) in U. Each non-niarked element needs to locate the nearest marked 
element to its left. This can be done in O(logn) time with ^ processors on an 
EREW PRAM. 

For proof of various lemmas below, we refer to the matching diagram of Fig 2. 
Lemma 4,5 For each element i, the parent p{j) is an element with which j has an 
edge in the graph. 

Proof: If j * Pi for some i, then by definition, p{j) has an edge with j. Otherwise, 
we consider p(i) as defined in cases (b) and (c) of the algorithm. If case (b) holds, we 
note that < 3 r”’(p 2 i+i) and j > Pa but Pj,- > Pjt+ij i-ej > Pji+ij or» l^kere 

is an edge between j and p{j) in the original graph. A similar treatment is valid for 

case (c). 

Lemma 4.6 For each element j, the parent pointers p{j) define a path to the root 
of the tree. 

Proof; Obvious from construction. ® 

Lemma 4.7 For any two elements j and k in the graph, ( level{j) - level{k) \< 1. 



level{j) = 


b) if 5 r~*(P 2 i-i) < ^~Kj) < ^(P2,+i) then p(j) = Pai+i; 

2 i + 2. 

/* If the first odd-indexed element whose position is less than position 
of j in D is Pz.-i, the parent of j in the BFS tree is Pa.+i.*/ 

c) else /* The first odd-indexed element whose position is less than posi- 
tion of j in D is pj.+i- */ 

P(j) = ■P2.-+2; level{j) = 2i 3 . 

/* If this condition holds the parent of j in the BFS tree is p2»+2*/ 
REMARK: Effectively, in this step each element j has to locate its corresponding P 2 i 
i.e, the first even-indexed element in the marked path such that P2i < j. Once this 
element is located, the values of P2,+i, P21-1, etc, which are needed in (b) and (c) are 
also available. This is achieved by marking the values of the even elements (in the 
marked path) in U. Each non-marked element needs to locate the nesurest marked 
element to its left. This can be done in 0 (log n) time with processors on an 
EREW PRAM. 

For proof of various lemmas below, we refer to the matching diagram of Fig 2 . 

Lemma 4. '5 For each element y, the parent p{j) is an element with which j has an 
edge in the graph. 

Proof: If j = Pi for some i, then by definition, p{j) has an edge with j. Otherwise, 
we consider p(t) as defined in cases (b) and (c) of the algorithm. If case (b) holds, we 
note that < ?r~^(P2,+i) and j > P2i but P2,- > P2.+i) i-e j > Pzi+i, or, there 

is an edge between 3 and p{j) in the original graph. A similar treatment is valid for 
case (c). I 

Lemma 4.6 For each element j, the parent pointers p{j) define a path to the root 
of the tree. 

Proof: Obvious from construction. I 

Lemma 4.7 For any two elements j and k in the graph, | level{j) — level{k) |< 1. 
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Proof: We consider the set of elements defined by case (b) of the algorithm. This 
set of elements has P2t+i? which is a i?-element, as parent, and we denote this set as 
i?2«+l • 

Note that for any element k € Rn+u T^~^{k) > P2,_i which is a P-element. If 
T < Pzi-i, then instead of P^i-i choosing Pni(P2«-i) = P2«» it would have chosen 
Pm{P2i-\) = fc, i.e there would be a contradiction. 

Thus, the P-sets constitute a partition, or no element in a P-set can have an edge 
with an element in another P-set. 

Similarly, we consider the set of elements defined as case (c) with parent P 2 i +2 
and denote this set as //2i+2 ^s this set is associated with a P-element. For an element 
k' € •£'2i+2) < p2t+3 there would be a contradiction otherwise. The P-sets 

constitute a partition similar to the P-sets. 

We now consider an element j € P2i+i • As already seen, it may have edges only 
with elements k belonging to any of the three sets: (i) P2,+i (ii) Z<2i (iii) f^2t+2- K 
case (i) holds, obviously level{j) = level{k). For cases (ii) and (iii), we recall that 
level{P2i) < level{P2i+i) < level{P2i^2)- So, if case (ii) holds, level{k) = 2 i + 1 , and 
for case (iii), level{k) = 2 i + 3 . But level{j) = 2i + 2. Thus, | level{j) — level{k) |< 1 
for all edges {j, k) in the original graph. I 

Theorem 4.2 The BFS tree of a connected permutation graph can be computed in 
O(logn) time with 5^ processors on an EREW PRAM. 

Proof: The correctness of the algorithm follows from Lemmas 4 . 5 , 4.6 and 4 . 7 . 
Lemmas 4.5 and 4.6 imply that the p(i)’s define a spanning tree and Lemma 4.7 
proves that this tree is indeed a BFST. I 

4.4 Conclusion 

In this chapter, an optimal parallel algorithm for constructing the BFS tree of a 
permutation graph has been described. The algorithm for the DFS tree is constrained 
by the resource requirements for constructing the MCC (MIS) of a permutation graph. 
The sequential complexity of the algorithm for the MCC problem is 0(n log log n). 
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However, as other steps of the DFS algorithm take only O(logn) time using i— 
processors on an EREW PRAM, discovery of a better algorithm for MCC problem 
will result in corresponding improvement in our algorithm. 


Postscript 

Very recently Ibarra and 22ieng [16] have discovered a O(log n) time optimal algorithm 
for shortest paths from any source node on an EREW PRAM in an unweighted 
permutation graph. However, their algorithm does not actually find a shortest path 
tree or BPS tree. 



Chapter 5 


Connected and Biconnected 
Components of a Permutation 
Graph 


5.1 Introduction 

We consider a permutation x of n integers in range [L.n]. On this permutation, 
a graph G{v) may be defined such that integers i and j have an edge in G{'k) iff 
(t — j)(T“*(i) ~ 7r"'(j)) < 0 where 5r“'(i) denotes the position of the integer i in x. 
An undirected graph G = {V^E) is defined to be a permutation graph if there exists 
a TT such that G is isomorphic to G{r) [14]. 

Throughout this chapter, it will be assumed that the permutation graph G = 
(V,j5) has been provided to us in the form of the permutation ir. Based on this 
fact, the terms vertex i and integer i sue used interchangeably. Two common repre- 
sentations of a permutation graph given in this form are the matching diagram and 
the representation of vertices of the graph as points in the plane. In the nnatching 
diagram we have an upper row U which is the sequence of all integers in the range 
[L.n] and a lower row D which is the permutation x. Each integer i in U is joined 
to the position in D where i occurs and, this line also subsequently refers to vertex 
i. Two integers i and j have an edge in G iff their corresponding lines intersect. The 
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matching diagram may be viewed as a bijective mapping between integers in [l..n] to 
their positions in v. 

In the plane, each vertex i may be represented by the co-ordinate pair (7r~^(i), i), 
i.e the y-coordinate is the value of the integer, while the x-coordinate is its position 
in TT. We also refer to this as the point i. From the definition of x point t has an edge 
in G with all points which lie in the south-east and north-west quadrants relative to 
its location in the plane, i.e point p = (xp,yp) has an edge with point q = if 

either (a) Xp < x, and pp > y, or (b) Xp > x, and yp < y,. 

In this chapter, we obtain a parallel algorithm to find the biconnected com- 
ponents of a permutation graph in 0 {logloglogn) time with linear work on the 
COMMON{TOLLRANT) CRCW PRAM or O(logn) time with linear work on the 
CREW PRAM. This improves on the previous O(logn) time, processor hicon- 
nectivity algorithm on the CREW PRAM of [22]. We also obtain an algorithm for 
finding the connected components which runs in ( 9 (logn) time with linear work on 
the CREW PRAM, 0 {logloglogn) time using processors on the COM- 

MON(TOLERANT) CRCW PRAM and 0 (log*n) time with processors on a 
Priority- Write CRCW PR.AM. 

The following routines are referred to frequently in the rest of the chapter; 

1 . Prerix-maxima( minima):- This routine computes the prefix-maxima(minima) 
for an array (aj , 02, ...an) where each a,- is an integer in the range This rou- 
tine takes 0 {Iog log log s) time using logiogidg, processors on the COMMON(TOLERAN 
CRCW PRAM [ 7 ] and O(logn) time using processors on the CREW 
PRAM. Subsequently, we refer to this time as r(n) and the processor count 

as p(n). For the special case when s = this routine takes 0 (log*n) time ^ 
with processors on a Priority- Write CRCW PRAM [ 7 ]. 

2 . Merge:- Given two sorted lists (01,02, ...Cn) and {bi,b2,...bn), where each ai( 6 i) 

is an integer in the range [L.3], this routine merges the two arrays to form 
a sorted array (ci,C2, — ,C2n). This routine runs in O(log log logs) time using 
i oglogioTg CREW PRAM [ 9 ]. 

^log’ n = loglog*"^ n; log' n = min{i : log' n < 1 .} 





5.2 Connected components 

Xiemma 5.1 A connected component in a permutation graph is a consecutive se- 
quence of integers, thus a connected component of size k starting at vertex(integer) 

I contains all vertices in the range + fc)]. 

Proof. We consider a connected component of size k in which the miriiTnnm ver- 
tex(integer) present is L Let j, {I < j < I + k) he the first integer not present in 
the given connected component. We consider the re mai ning (fc — (j — vertices in 
the graph which are greater than j and occur in the same connected component as 
1. At least one of these vertices has to have an edge with (intersect in the matching 
diagram) some vertex in the range — 1)] if all of them lie in the same connected 
component. But, for this to happen, some vertex in one of the two sets necessarily 
intersects{has an edge with) the line j, and therefore, j also lies in the same connected 
component. | 

Thus, partitioning the vertices of a permutation graph into its connected com- 
ponents is equivalent to identifying a set of integers each of which is the ::rst(least) 
member of a connected component. We next state the necessary and suflficient con- 
dition for integer i to start a connected component. 

Lemnna 5.2 An integer i starts a connected component iff there does not exist any 
integer j such that j < i and 7r'~’‘(i) > i 

Proof: We consider the integers i and j, {j < i) in the upper row U of the matching 
diagram. Now, if j maps to a position in D which is greater than or equal to i, this 
implies that 3k such that T~^{k) < i and k > i. This follows from the fact that if 
the i — 1 integers in the range [l..(i — 1)] in U map to i positions in D, there exists 
at least one integer k in D, •K~^{k) < i which can not be mapped to an element in U 
in the range - 1)]. Thus, integer k in L necessarily maps to a position greater 
than the position of t in f/, or fc intersects L In the graph we thus have edges 
and (fc, i) where j <i < k and so, i belongs to a connected component whose starting 
element is less than or equal to which is a contradiction. I 
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Lemma 5.3 The integers s,- which start a connected component can be computed 
in 0{logn) time on the CREW PRAM, or O(logloglogn) time on the COMMON 
CRCW PRAM or 0(log'' n) time on a Priority-write CRCW PRAM with linear work. 

Proof: From Lemma 5.2. for all integers to check the condition as stated, a sufB.x- 
minima computation is required on array r. Integer s,- starts a connected component 
if the sufFix-minimum value at position s,- is equal to a,-. Each element in ir is in the 
range [l.-n] and so, the faster(0(log* n) time) prefix routine can also be applied. 1 
Lemma 5.4 Each element identifies the connected component it belongs to within 
the resource bounds of prefix-maxima routine. 

Proof: We consider the array N which is the sequence of integers [l..n]. Each element 
which knows that it is the starting element of a connected component, marks itself 
as 1 and others mark themselves as 0. From Lemma 5.1, each 0 identifies itself in the 
connected component started by the nearest 1 to its left. ® 

Theorem 5.1 The connected components of a permutation graph can be identified in 
0(log' n) optimal time on a Priority-write CRCW PRAM or, O(logloglogn) optimal 
time on the COMMON(TOLERANT) CRCW PRAM or, O(logn) optimal time on 
the CREW PRAM. 

Proof: The correctness of the algorithm follows from Lemma 5.1 and 5.2. The 
resource bounds follow from Lemmas 5.3 and 5.4. ® 


5.3 Biconnected components 

In this section we consider the representation of each graph vertex as a point in the 
plane. A point p,- = is said to be dominated by another point pj = (xj,l/j) if 

Xj < Xi and Vi > Vi. In the context of a permutation graph, there is an edge between 
integers y.- and y^ in the graph. The set of roots r,- € G is defined to be the set of 
elements such that n is not dominated by any other point in G, i.e, there does not 
exist any element j for which j > r,- and jt (j) < ir 

Lemma 5.5 The set of all n’s can be computed in r(n) time with p(n) processors. 
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Fig 1: Plane representation 

Proof: From the definition of r,-, it follows that in x, r,- does not have any larger ele- 
ment to the left, i.e, r,’s are exactly those elements such that the prefix-maximum(in 
ff) at the location :r“*(ri) equals r,-. ^ 

Lemma 5.6 In the array the values present in locations (ri,r 2 ...) form an in- 
creasing sequence. 

Proof: From the definition of r; it follows that there is no integer less than n and 
having a position greater than r,’ in tt. Equivalently, in no location less than ri 
contains a value greater than the value contained in location r,-. I 

Thus, by construction, S, the set of n’s, is impUcitly ordered both by value and 

position in 

For any point pj = we define the closest dominating root as that which 

dominates pj and is the nearest in y-coordinate among all dominating r,- s. The set 
of all points which have ri as their closest dominating root is termed as Drj. This 
region is shown in Fig 1. 

Lemma 5.T The set consists exactly of all the integers in the range [yn^i-yn]- 
Proof: By definition, the set can not have integers with value greater than or 
equal to yr,-. But if any integer in Dn is less than j/r.-n it is also dominated by ri_i. 
So, it would choose some rj, j < {i - 1) as its closest dominating root. Thus, only 
integers fe, (jfri.i <k <yri) belong to Dn^ ® 



From the definition of jt the elements belonging to Dn thus occur in a contigu- 
ous portion of the array between locations and rj. 

In the set 0 ^ , we now consider points Cij such that each Cjj is dominated by only 
and by no other integer in Equivalently, if we consider region and remove 
r; from the graph, the elements Cij are the roots. In this context(i.e considering Cfj’s 
to be the roots), the regions may be similarly defined i.e, the region Ddj consists 
of all elements which belong to and have c,j as their closest dominating root in 
Dr,. 

Lemma 5.8 The set D^j consists exactly of all the integers in the range 
Proof: The proof Is similar to that of Lemma 5.7. I 

Thus, the elements belonging to each also occur in contiguous locations 
(within the region defined by their corresponding Dn) in 

In the set Dc„ we define lx[ij] to be the integer with largest r-coordinate(i.e 
position), and srfij] to be the integer with second largest x-coordinate. 

Lemma 5.9 The integers lx[ij] and sx[ij] can be computed for all Da^ in -(n) time 
with p{n) processors. 

Proof: From Lemma 5.8, tt"' is partitioned such that Da^ is exactly the set of 
integer.s between cm-i) and c,j. So, lx[ij] is simply the maximum value in the subarray 
7r“'*[(cqj_ij)..(cy)] and can be computed using prefix-maxima routine. I 

In the set Dr, (for some i), corresponding to Cij we define the set Ct{ij) as follows: 
Ci{ij) ~ {cik I k < j and x[cfj] < /x[ifc]}. Intuitively, Ci{ij) consists of Cik's 
for which contains an element (/x[i/:]) dominated by c,j; i.e., there is an edge 
between lx[ik] and Cij in G. Thus, by a result of Liang et.al. [22], the elements of D^j, 
together with elements of Da^ will form a biconnected component. Term hpan[cij] 
will denote the y-coordinate (i.e value) of the minimum element in Ci{ij). Thus, if 
fc' = mm{k 1 Cik € Ci{ij)}, then lspan[cij] = y[Av], 

Lemma 5.10 For each c,j, /span[ci^] may be computed in r(n) time with p{n) pro- 


cessors. 



Proof: For some i we consider the computation of all /span[c.-,]’s. Recall, the el- 
ements Cij are increasing both in value and in position. Since we need to compute 
the first Cik such that lx[ik] > x[cij](x~^(c,j)), an obvious way is to compute the 
prefix-maxima over the \’alues (rc-coordinate) of lx[ik] in and find out the first 
location where the prefix-maxima value is more than x[c,-j] by a binary search on the 
prefix- maxima values. However, we can improve on the resource bounds by noting 
that after computing prefix-maxima we have got a sorted(non-decreasing) array of 
integers in the range [L.n], So, to locate the positions of all Cjj in this array, we 
can merge the two arrays and marking each of the prefix-maximum values as 1 and 
the Cij elements as 0, we just need to compute for each 0 the nearest 1 to the right. 
This will give the identity of the desired c,-^. Now, the regions Dj.^ being disjoint, the 
computation for all of them can be carried out simultaneously. The total sum of the 
number of elements in ail regions is 0(n) only, and so the computation can be 
carried out within the resource bounds of prefix-maxima and merge routines. I 

An interval graph [14] consists of a set of intervals /,• = [a,-, 6,], (a,- < &,) on the real 
line: there is a vertex in the graph for each interval and there is an eng. .ween two 
vertices if their corresponding intervals have a non-empty intersection. We consider 
the set Dr, and construct an interval graph Gi in the following manner: corresponding 
to each c,^ belonging to the region Dn, there is an interval [/span[cij],y[c;j]] .in Gj. 
From [22] the connected components in G,- are biconnected components in the region 
Dr,. Note that in the set D^., the right end-points of the intervals occur in sorted 
order and the left end-points are also sorted. Computing the connected components 
in an interval graph is equivalent to computing prefix-minima after sorting the end- 
points. In this case, sorting of end-points is equivalent to merging of two sorted arrays 
of integers. Thus, the routines prefix-minima and merge together solve the connected 
component problem on each G;. As, the elements belonging to the D^'s (and hence 
G,’s) are disjoint and the total number of intervals is 0(n) only, this computation 
can also be carried out in r(n) time with p{n) processors. 

Once the connected components in G,- have been computed, we retain only those 
Cij’s which are the minimum elements in some connected component and remove the 
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other Cik s along with the elements in regions 

In this reduced graph, corresponding to aj we define the set Cd{ij) as follows: 
C'ci(ij) — {cki ] fc < z and x[c,j] < sx{/:l']}. Intuitively, Cd{ij) consists of ch’s for which 
contains two elements and sx[fc/]) dominated by c,-y; i.e., tbere is an edge 

between and Cjj and also between sx[i&] and Cjj in G. Thus, by a result of Liang 
et.al. [22], the elements of together with elements of D^j will form a biconnected 
component. Term <ispan[cij] will denote the y-coordinate(i.e value) of the TTninimiim 
element in Thus, if k' = min{fc j Cfc, € Ci(zi)} and V = min{/ 1 cyi € Ci{ij)} 

dspanfcij] = . 

Lemma 5.11 For all Cjj’s, dspan[c,j] may be computed in r(n) time with p(n) pro- 
cessors. 

Proof: Consider computation of dspan[cij] for an element cij: in the reduced tt"^, if 
we consider only sx’s, the computation effectively locates the least sx for whidi the 
position in tt is greater than position of c,j in -k. Thus, in reduced t if we compute 
suffix-minima on only sx’s, the suffix-minima value at position 7r“^(c,j) mdicates this 
desired element; hence we can solve the problem within the bounds of prefix-maxima 
routine. I 

From (22), the connected components in G' are biconnected components in the 
given permutation grapli. 

Lemma 5.12 The connected components of G' can be computed in r(n) time with 
p(n) processors. 

Proof: In Gi, both the end-points were sorted, but in G' only the right end-points 
are sorted- However, in an interval graph, if any one of the end-points are sorted, the 
connected components may be computed using prefix-maxima [28]. I 

Thus we have the following theorem. 

Theorem 5.2 The biconnected components of a permutation graph can be computed 
in O(logn) time using processors on the CREW PRAM or O(log log log n) time 
using processors on the COMMON(TOLERANT) CRCW PRAM. 

Proof: From the previous discussion, the costliest steps needed in the algorithm are 
prefix-innaxima and merge which give rise to the stated resource boimds. I 
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Concluding Remarks 


In Chapter 2, constant time parallel algorithms have been obtained for some prob- 
lems in interval graphs on the reconfigurable mesh model. However, there axe some 
problems in interval graphs which can not be solved in constant time, for e.g. finding 
the shortest path between a pair of vertices, even with O(n^) processors. List-ranking 
within the stated resource bounds appears to be a bottleneck. . 

For the most vital edge problem considered in Chapter 3, the time taken by 
the sequential algorithm is 0(min(m + nlogn, 7nQ(m,n)) while the time-processor 
product of the parallel algorithms is 0(min(n^,m logm)). Hence there is still some 
scope for further improvement. 

In Chapter 4, the BFS algorithm runs optimally in O(logn) time on the EREW 
PRAM which is the weakest PRAM model- on this model, as logn is a lower bound 
for most problems, there appears to be little scope for improvement. However, on 
some other model, a faster parallel algorithm may be possible. For the DFS algo- 
rithm, as repeatedly mentioned, the bottleneck is computation of longest increasing 
subsequence. Obtaining an optimal parallel algorithm matching the lower bound of 
O(nloglogn) is an open problem. 

In Chapter 5, the connected and biconnected components for a permutation graph 
are computed optimally. The algorithm for biconnected components could possibly 
be extended to optimally compute the fc-connected components of a permutation 
graph- 
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